straightforward application
Reviews: LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning
Overall, the method provided is a straightforward application of a known IR method to MARL, the results are promising and the writing is clear. As such, this work has limited novelty but provides good empirical contributions, though these too could be improved by considering more domains. A more detailed review of the paper, along with feedback and clarifications required are provided below. The work is motivated by the claim that providing individual IRs to different agents in a population (in a MARL setting) will allow diverse behaviours. The analysis at the end of the paper shows that a lot of the learned IR curves do overlap.
learning individual intrinsic reward, multi-agent reinforcement learning, straightforward application, (3 more...)
Technology: